Scaling of degree correlations and the influence on diffusion in scale-free networks 
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Connectivity correlations play an important role in the structure of scale-free networks. While 
several empirical studies exist, there is no general theoretical analysis that can explain the largely 
varying behavior of real networks. Here, we use scaling theory to quantify the degree of correlations 
in the particular case of networks with a power-law degree distribution. These networks are classified 
in terms of their correlation properties, revealing additional information on their structure. For 
instance, the studied social networks and the Internet at the router level are clustered around 
the line of random networks, implying a strongly connected core of hubs. On the contrary, some 
biological networks and the WWW exhibit strong anti-correlations. The present approach can be 
used to study robustness or diffusion, where we find that anti-correlations tend to accelerate the 
diffusion process. 
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The topological structure of complex networks is 
largely determined by the way in which the constituent 
units are interconnected. Correlations in the connectivity 
of complex networks have been proved to be important 
and have been used to explain the functionality, robust- 
ness, stability, and structure of networks from Biology [l| 
and Sociology [2| to Computer Science [3| . A study of the 
correlation profile in a network of protein-protein inter- 
actions revealed that links from hubs to non-hub nodes 
are favored [l|, a result with consequences for the sta- 
bility and the modularity in biological networks. In the 
case of social networks, Newman has shown that most 
of them are assortative (i.e. hub-hub correlations dom- 
inate the system) |2|, and Colizza et al. demonstrated 
the 'rich-club' phenomenon where all hubs tend to form 
a connected cluster [4(. As a result, social networks are 
more difficult to immunize and diseases can spread fast. 

Recently, it was also shown that hub anticorrelations, 
i.e. the tendency of the hubs not to be directly connected 
with each other, give rise to fractal networks [5| , such as 
the undirected (symmetrized) WWW, the protein homol- 
ogy network 0] and other biological networks. On the 
contrary, when there is a large probability of direct hub 
connections the resulting networks, such as the Internet, 
the cond-mat co-authorship and other social networks, 
are non- fractals In this category falls also the ran- 
dom configuration model [H, 0| . 

An important topological feature of complex networks 
is the degree distribution P(k), where k is the number 
of links for a given node. Although the form of P(k) 
has a direct influence on the network properties, it can- 
not convey all the information for the network structure. 
Thus, two networks can have the same distribution P(k) 
but with completely different topologies, determined by 
the presence of degree correlations. This structure can 
be captured by the probability P(k±, k%) that two nodes 
of degree k\ and k 2 are connected to each other, and by 
quantities derived from P{k\,k2), such as the Pearson 



coefficient r, the average degree of nearest neighbors fc nn , 
etc. 

Despite their importance, a general theoretical frame- 
work to describe and characterize degree correlations in 
scale-free networks is still work in progress. For an at- 
tempt to describe correlations using a master equation 
approach see [l(|. Here, we find that the degree corre- 
lations in the studied scale-free networks can be charac- 
terized in terms of a correlation exponent e, which we 
calculate using a renormalization approach. This allows 
us to propose a classification of a set of dissimilar net- 
works, according to the degree of correlations into a small 
number of different classes in a "phase diagram" . For 
example, biological networks and the WWW are in the 
strong anti-correlations part of the diagram, while social 
networks and the Internet are clustered near the region of 
random networks. We show how we can use these ideas 
to explore more network properties, such as diffusion and 
robustness, which depend on the degree of correlations in 
the network. 

We start by recalling the renormalization of a network 
under a scale transformation. The renormalization pro- 
cedure tiles a network according to the box-covering al- 
gorithm (llj , with the minimum number of boxes where 
the maximum distance in any box is less than £b- Each 
box is subsequently replaced by a node, and links are 
established between these new 'super-nodes' if at least 
one node included in a box was connected to any node 
of the other box. These boxes are treated as the nodes 
of the renormalized network. Renormalization is a reli- 
able method for determining how the network behaves at 
different length scales. Self-similarity is then obtained if 
the network structure remains invariant under the renor- 
malization. 

Alternatively, we can retain multiple links between the 
boxes when we renormalize a network 12j. As we show 



below, though, this is not a strong effect, mainly due to 
preservation of the self-similar structure under renormal- 
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FIG. 1: The joint degree distribution P(k 1 ,k 2 ) of WWW (top 
row) and Internet at the router level (bottom row) before 
renormalization (left), after renormalization forbidding mul- 
tiple links (center), and including multiple links (right). 

ization. In contrast, during a random rewiring process 
degree correlations are destroyed, but bias is introduced 
if multiple links are forbidden. 

We use renormalization and scaling theory to deter- 
mine the form of P(ki,k 2 ). Since the self-similarity 
of a scale-free network requires the invariance of the 
degree distribution P(k), a power-law distribution of 
P(k) ~ fc~ 7 , where 7 is the degree exponent, is the only 
form that can satisfy this condition 5] . Taking this idea 
one step further, it is interesting to clarify whether corre- 
lations between degrees, as expressed by the joint degree 
distribution P(k\,k 2 ) 1 also remain invariant. In Fig. [1] 
we present an example of this distribution before and af- 
ter renormalization for the WWW and the Internet at 
the router level (similar results are derived for other net- 
works, as well). Allowing multiple links between boxes 
does not significantly modify the result. The statistical 
similarity of the corresponding plots suggests the invari- 
ance of P(ki, k 2 ). Accordingly, this suggests that the k\ 
and k 2 dependence can be separated and the behavior of 
the tail of the joint degree distribution is: 

P(h,k 2 ) ~ k-^- 1] k- f - (h > k 2 ). (1) 

The value of the first exponent 7 — 1 in Eq. ([T]) is obtained 
from the density conservation law: J P(k%, k 2 )dk 2 = 
fciP(fci) ~ fc x ^ 1 . Equation |T]) is also consistent with 
the known result for completely random networks 

P{h, k 2 ) ~ fciF(fc 1 )fc 2 P(fc 2 ) ~ ki^k 1 ^ , (2) 

i.e. the exponent e for these networks is e ran d =7 — 1, 
as expected from the symmetry in this case. Eq. (JlJ is 
also consistent with previous findings in Ref. [l(. The 
probability distribution for the neighbor connectivity in 
the yeast protein interaction network was there shown 



to behave differently for low-degree nodes, where the k- 
dependence was fc 1 ^ 7 , and for large-degree nodes, with a 
fc~ 7 dependence. Using Eq. fT} we can see that integrat- 
ing over k 2 for low-degree nodes (fci > k 2 ) we retrieve the 
fc 1-7 dependence. For the case of hubs, where integration 
is over k\, the dependence on the degree is k~ e and for 
the yeast protein interaction network we have calculated 
that e = 7 (see Fig. [3]). These results are in agreement 
with the observed behavior in [l|. 

Next, we introduce a scale- invariant quantity Eb(k) to 
simplify the estimation of e, even for small networks. 
We are motivated to introduce this quantity by asking 
whether a node is significantly linked to more connected 
nodes, i.e. a node considers another node as a 'hub' if its 
degree is much larger than its own. We define the ratio 

, . f™P(k\k')dk' 

as the measure of a node's preference to connect to neigh- 
bors with degree larger than bk (b is an arbitrary positive 
number, and large b corresponds to the identification of 
the hubs) 13]. From the scaling of E b (k) with k, we 
are able to obtain the exponent e in a simpler way than 
using P{k\,k 2 ), which presents more fluctuations than 
the average quantity E b (k). The conditional probability 
is P(k\k') = P{k,k')/ J P(k,k')dk = P(fc, fc')/ fc,1 ~ 7 = 
fc-(7-i)^,'-(i+e-7) p We find for a scale-free distribution: 

E b (k) ~k^~~<K (4) 

We have verified that the scaling of E b {k) remains invari- 
ant under renormalization. The same scaling exponents 
are recovered for the renormalized networks, even when 
multiple links are allowed between two boxes (Fig. [2] in- 
set). In the latter case, the renormalized nodes have in 
general larger degrees, which means that deviations ap- 
pear in nodes of smaller degree. Additionally, e was found 
to be independent of the value of b. We notice that other 
quantities derived from P{k\,k 2 ) may not be invariant 
under renormalization, such as r or fc nn , and therefore 
are not suitable to distinguish fractal from non-fractal 
networks. 

In Fig. [2] we present the behavior of E b (k) for the 
WWW, protein homology, Internet (router level) and 
cond-mat authorship. The existence of a scaling rela- 
tion over a k range, combined with the invariance of this 
curve, support Eq. (j4|) and the form used for P(ki,k 2 ) 
in Eq. JT]). The WWW and the protein homology net- 
work have been shown to have a fractal topology. The 
slope of Et,(k) with k is small or negative in these cases 
with values of e = 2.5 and e = 2.4, respectively. This 
behavior is in contrast with the two non-fractal networks 
in the figure, i.e. the Internet at the router level and the 
cond-mat co-authorship network, where Eb(k) increases 
almost linearly with increasing k. For these networks we 
find that e = 1.2 and e = 1.6, respectively. 
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FIG. 2: Plot of E b (k) versus k for the WWW, protein ho- 
mology, Internet at the router level and cond-mat network. 
Different topologies correspond to different scaling behavior 
with the degree k. Inset: Plot of Eb(k) versus k for a) the 
Internet, b) the renormalized Internet network without multi- 
ple links between two nodes, and c) the renormalized Internet 
network allowing multiple links between two nodes. The data 
have been vertically shifted in order to show the invariance. 
In b) and c) we use the MEMB method [llf with tb = 3, and 

In order to interpret the values of e we now turn to the 
renormalization scheme [5J. After renormalization the 
number of nodes TV in the network and the degree of a 
node k scale with £b as power laws with fractal exponent 
d,B and degree exponent dk , respectively (we use a prime 
to describe quantities measured in the renormalized net- 
work): 

N -> N 1 ~ £ B dB N , k -> k' ~ £ B dk k . (5) 

If ds and d/~ are finite, the network is fractal. If ds — > oo 
and dk —> oo (or equivalently the decay is exponential or 
faster) the network is not fractal. 

After tiling the network with boxes of diameter ig, 
each of these boxes have one unique local hub (i.e. the 
largest degree node in the box). Considering all possible 
pairs of boxes, we introduce the probability £{£b) that 
there exists a direct connection between the two hubs of 
any two boxes. We have shown (see e.g. Figs. 2e, 3d of 
Ref. 0) that the probability £ scales with the length £b 
as 

£(£ B ) ~ £ B d " . (6) 

Below we relate the exponent e to the hub-hub repulsion 
through the hub correlation exponent d e , which is crucial 
for fractality. 

The conservation of links in the renormalized network 
leads to the expression 

NP(ki, k 2 )dk 1 dk 2 = £(£ B )N'P'{k' 1 ,k' 2 )dk[dk' 2 . (7) 



FIG. 3: Classification of scale- free networks [T(|. We use 
the correlation exponent e in order to quantify the degree 
of correlations and the fractality of a network, as a function 
of 7. The line e ra nd =7 — 1 corresponds to a completely 
random network structure. The line e — 2 separates fractal 
(e > 2) from non- fractal networks (e < 2), while the line 6 = 7 
describes a fractal tree [7(. The four schematics illustrate 
networks where hub correlations are stronger than in random 
networks (area I), weaker than random but non-fractal (area 
II), non- fractal according to the minimal model of 0] (e = 2), 
and fractal (area III). 

Using Eqs. (JTJ, ©, ©, and © we get the relation 
£ d B B £ d B -£ [ B ~~ 1 " i)dk = 1 which finally leads to 

e = 2 + 4A4 = 2+ (7-1)^, (8) 

ds 

where we have substituted the value 7 = 1 + dg/dk- This 
relation of e with d e shows that correlations between the 
hubs of the boxes determine the correlations for all de- 
grees, in accordance with the invariance under renormal- 
ization. 

The direct determination of e through the slope of 
E},{k) vs k enables us to construct a 'phase diagram' in 
the plane (e, 7), shown in Fig. [3] This plot is classifying 
the studied networks in classes according to their degree 
of correlations, even though they correspond to dissimilar 
systems in biology, sociology or technology. 

As shown in Eq. p]). the exponent for a random net- 
work corresponds to the random line e ra nd = 7 — 1) which 
is verified in the plot for different 7 values of the config- 
uration model. In random network models, correlations 
arise because links are selected for connecting with each 
other equiprobably, so that the probability of two hubs 
being connected is large [lij ]. Thus, networks that are 
close to the line e ran d = 7 — 1 exhibit hub-hub corre- 
lations. The random line separates the diagram in two 
main parts: (a) above the line where the hub correlations 
tend to become weaker, and (b) below the line where net- 



works have even larger correlations (hubs are connected 
to each other with even higher probability than the one 
corresponding to a randomly created structure). 

In the diagram, the social networks and the Internet at 
the router level are clustered around the line e ran( j = 7—1. 
This is an indication that there is a strongly connected 
core of hubs in these systems, consistent with previous 
studies. The biological networks and the WWW, on the 
other side, are far away from the random line. This im- 
plies that there is a richer structure in these networks 
with hubs separated from each other. The distance in the 
plot from e ran d quantifies how different from randomness 
the network structure is, in terms of degree correlations. 

As e increases from e ran d, we expect that at some 
point the networks will become fractal, due to increased 
hub-hub correlations. The point of emergent fractality 
is found through Eq. (|SJ), where the borderline case of 
ds — > 00 yields e — 2. Indeed, we have verified via direct 
measurements of ds that all the networks above the line 
e = 2 in Fig. [3] are fractals. 

Thus, starting from e ran d we can separate the phase 
space into areas where the hub correlations are stronger 
than in random models (area I) or weaker than that (ar- 
eas II and III). The weak correlation areas II and III 
are further divided by the line e = 2 which determines 
whether the anticorrelations are strong enough to result 
in a fractal network (III) or not (II). 

An immediate result from this diagram is the differ- 
ent position of the Internet at the router level compared 
to the AS level [l^|. Although the degree distribution 
of these two networks is the same (7 = 2.2), the corre- 
lation exponent e reveals that there are more hub-hub 
connections at the router level, similar to the case of a 
random network. Contrary to that, the AS level exhibits 
a structure with less correlations deviating from that of 
a simple random model. This difference may hint on dif- 
ferent design principles or requirements at varying levels 
of the Internet. 

The above approach can be directly applied to explore 
many interesting properties, such as network robustness, 
synchronization, or diffusion processes. Until now, the- 
oretical studies have been limited to using the uncorre- 
cted version of P(k'\k) ~ k'P(k'). The introduction of 
Eq. (JTJ) enables us to substitute this form and generalize 
the problem for networks with known correlation expo- 
nents. For example, we can study the effect of correla- 
tions on diffusion by starting with the master equation 
for the density of particles p(k, t) on nodes with degree k 
at time t 



dp(k, t) 
dt 



-p(k,t) 



P (k',t) 



(9) 
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FIG. 4: Influence of correlations on diffusion, for scale-free 
networks with 7 = 2.75. The presented e values correspond to 
different areas in Fig. 3. Decreasing hub-hub correlations (top 
to bottom) leads to faster convergence towards equilibrium, 
i.e. diffusion is accelerated. 



and substitute P{k'\k) with a form derived through 
Eq. (jT|). The Laplace transform of the above equation 
leads to a Fredholm integral equation of the second kind 



with separable kernel that can be solved analytically. We 
define the quantity K(t) = (k x (t)} 1 / x , where x = 7— 1 — e, 
and K(t) serves as a measure of the diffusing parti- 
cles preference to larger or smaller degrees k. The an- 
alytical result for K x (t), defined as n x (t) = (k x )(t), is 
Kx (t) — + (k q - Koo)e _ct , where = (7 - 2)/ (e - 1) 
and Ko = (7 — l)/e are constants depending on 7 and e, 
while c = (e - l) 2 /((7 - 2)(2e - 7)). The result for K(t) 
is displayed in Fig. 2] for networks with 7 = 2.75. The 
exponential convergence to the asymptotic steady-state 
configuration depends on the value of c, which increases 
with the exponent e. Networks that are close to the ran- 
dom case e ra nd = 7 — 1 require longer times for reaching 
the equilibrium state and the diffusing particles prefer to 
occupy larger degree nodes. A larger e value enhances 
anti-correlations in the network and the particles move 
faster occupying smaller degree nodes on the average. We 
can infer, thus, that stronger correlations tend to speed 
up the diffusion process. The mechanism behind this be- 
havior is as follows: when the hubs are directly connected 
to each other the particles tend to remain localized in the 
neighborhood around these hubs, so that it takes longer 
for them to explore wider areas. On the contrary, when 
hub anti-correlations are important the particles spend 
most of their time in the intermediate areas which are 
formed by smaller degree nodes and which connect indi- 
rectly the hubs to each other. 
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Havlin and support from NSF grants. 
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